Score Standardization for Robust Comparison of Retrieval Systems

نویسندگان

  • William Webber
  • Alistair Moffat
  • Justin Zobel
چکیده

Information retrieval systems are evaluated by applying them to standard test collections of documents, topics, and relevance judgements. An evaluation metric is then used to score a system’s output for each topic; these scores are averaged to obtain an overall measure of effectiveness. However, different topics have differing degrees of difficulty and differing variability in scores, leading to inconsistent contributions to aggregate system scores and problems in comparing scores between different test collections. In this paper, we propose that per-topic scores be standardized on the observed score distributions of the runs submitted to the original experiment from which the test collection was created. We demonstrate that standardization equalizes topic contributions to system effectiveness scores and improves inter-collection comparability.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A magnetorheological fluid damper for robust vibration control of flexible rotor-bearing systems: A comparison between sliding mode and fuzzy approaches

Squeeze Film Dampers (SFD) are commonly used for passive vibration control of rotor-bearing systems. The Magnetorheological (MR) and Electrorheological (ER) fluids in SFDs give a varying damping characteristic to the bearing that can provide active control schemes for the rotor-bearing system. A common way to model an MR bearing is implementing the Bingham plastic model. Adding this model to th...

متن کامل

Delay-Dependent Robust Asymptotically Stable for Linear Time Variant Systems

In this paper, the problem of delay dependent robust asymptotically stable for uncertain linear time-variant system with multiple delays is investigated. A new delay-dependent stability sufficient condition is given by using the Lyapunov method, linear matrix inequality (LMI), parameterized first-order model transformation technique and transformation of the interval uncertainty in to the norm ...

متن کامل

Performance Evaluation of Medical Image Retrieval Systems Based on a Systematic Review of the Current Literature

Background and Aim: Image, as a kind of information vehicle which can convey a large volume of information, is important especially in medicine field. Existence of different attributes of image features and various search algorithms in medical image retrieval systems and lack of an authority to evaluate the quality of retrieval systems, make a systematic review in medical image retrieval system...

متن کامل

A Robust Retrieval System of Polyphonic Music Based on Comparison of Chord Sequences

Retrieval systems for polyphonic music rely on the automatic estimation of the similarity between two musical pieces. In the case of symbolic music, existing systems consider a monophonic reduction based on melody or propose algorithms with high complexity. In this paper, a new approach is presented. Musical pieces are represented as a sequence of chords estimated from groups of notes sounding ...

متن کامل

Identifying and Ranking the Important Textual and Paratextual Elements in Fiction Retrieval

Purpose: The purpose of this study is to identify the textual and paratextual elements in retrieving fiction from the readers’ perspective in order to provide the most appropriate access points for the readers and to improve access to fictions based on the readers’ needs. Method: The current research is an applied study in terms of purpose, applying a mixed method that was conducted using the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007